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Abstract 

Here we present a theoretical study on the main properties of FIEGARCH processes. We 
analyze the conditions for the existence, the invertibility, the stationarity and the ergodicity of 
these processes. We prove that, if {X t }tez is a FIEGARCH(p, d, q) process then, under mild 
conditions, {ln(A 2 )} teZ is an ARFIMA(g, d, 0) process. The convergence order for the poly- 
nomial coefficients that describes the volatility is presented and results related to the spectral 
representation and to the covariance structure of both processes {ln(A 2 )} te ^ and {ln(of )} te z 
are also discussed. Expressions for the kurtosis and the asymmetry measures for any station- 
ary FIEGARCH(p, d, q) process are also derived. The h-step ahead forecast for the processes 
{A t }t e z, {ln(cr 2 )} te z and {ln(X 2 )} te z is given with their respective mean square error forecast. 
The work also presents a Monte Carlo simulation study showing how to generate, estimate and 
forecast based on five different FIEGARCH models. The forecasting performance of ARCH-type 
models is compared through an empirical application to Brazilian stock market exchange index. 

Keywords. Long-Range Dependence, Volatility, Stationarity, Ergodicity, FIEGARCH Pro- 
cesses. 
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1 Introduction 

Financial time series present an important characteristic known as volatility which can be de- 
fined/measured in different ways but it is not directly observable. A common approach, but not 
unique, is to define the volatility as the conditional standard deviation (or the conditional variance) 
of the process and use heteroskedastic models to describe it. 

ARCH-type models, proposed by Engle (1982), constitute one of the main classes of econometric 
models used for representing the dynamic evolution of volatilities. Another popular one is the class 
of Stochastic Volatility (SV) models. A drawback of this class is that the volatility is specified as a 
latent variable which is not directly observable and this can make the estimation challenging. By 
ARCH-type models we mean not only the ARCH(p) model proposed by Engle (1982), which char- 
acterizes the volatility as a function of powers of past observed values (consequently, the volatility 
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can be observed one-step ahead), but also the several generalizations that were lately proposed to 
properly model the dynamics of the volatility. 

Among the generalizations of the ARCH model are the so-called Fractionally Integrated Expo- 
nential GARCH (FIEGARCH), introduced by Bollerslev and Mikkelsen (1996), and Fractionally 
Integrated GARCH (FIG ARCH), proposed by Baillie et al. (1996). These models generalize, re- 
spectively, the EGARCH model (Nelson, 1991), and the GARCH model (Engle and Bollerslev, 
1986), and they capture the effects of long-memory characteristic in the conditional variance (see 
Doukhan et al., 2003). 

FIEGARCH models have not only the capability of modeling clusters of volatility (as in the 
ARCH and GARCH models) and capturing its asymmetrjj^] (as in the EGARCH models) but they 
also take into account the characteristic of long memory in the volatility (as in the FIGARCH 
models, with the advantage of been weakly stationary if d < 0.5). Besides non-stationarity (in the 
weak sense), another drawback of the FIGARCH(p, d, q) models is that we must have d > and the 
polynomial coefficients in its definition must satisfy some restrictions so the conditional variance 
will be positive. FIEGARCH(p, d, q) models do not have this problem since the variance is defined 
in terms of the logarithm function. 

Some authors argue that the long memory behavior observed in the sample autocorrelation 
and periodogram functions of financial time series could actually be caused by the non-stationarity 
property. According to Mikosch and Starica (1999), long range behavior could be just an artifact 
due to structural changes. On the other hand, Mikosch and Starica (1999) also argue that, when 
modeling return series with large sample size, considering a single GARCH model is unfeasible and 
that the best alternative would be to update the parameter values along the time. As an alternative 
to the traditional heteroskedastic models, Ming (2000) presents a regime switching model that, 
combined with heavy tailed distributions, presents the long memory characteristic. 

It is our belief that FIEGARCH models are a competitive alternative for modeling large sam- 
ple sized data, especially because it avoids parameter updating. Also, as we prove in this work, 
FIEGARCH processes are weakly stationary if and only if d < 0.5 and hence, non-stationarity can 
be easily identified. Moreover, Saadi et al. (2006) analyze the daily returns of the Tunisian stock 
market and rule out the random walk hypothesis. According to the authors, the rejection of this 
hypothesis seems to be due to substantial non-linear dependence and not to non-stationarity in 
the return series and, after comparing several ARCH-type models they concluded that a stationary 
FIEGARCH model provides the best fit for the data. Furthermore, Jayasuriya (2009) presents a 
sub period investigation of long memory and structural changes in volatility. The authors consider 
FIEGARCH models to examine the long run persistence of stock return volatility for 23 developing 
markets for the period of January 2000 to October 2007. No clear evidence that long memory 
characteristic could be attributed to structural changes in volatility was found. 

In practice, usually a simple FIEGARCH (p, d, q) model with p, q £ {0,1} suffices to fully describe 
the data. For instance, Saadi et al. (2006) and Ruiz and Veiga (2008), consider FIEGARCH(1, d, 0) 
models while Jayasuriya (2009) considers FIEGARCH(1, d, 1) models. However, for the time series 
considered in the working papers by Prass and Lopes (2011a, 2011b), higher values of p and q 
are needed to provide a good fit for the data. Thus, in this paper we present the main theoret- 
ical properties of FIEGARCH(p, d, q) processes, for any p, q > 0, extending the results known in 
the literature for the particular cases p,q G {0,1}. We also present a simulated study including 
generation, estimation and forecasting features of FIEGARCH models. Knowing the theoretical 
properties of these processes is fundamental since any estimator used to fit a model to a time series 

1 By asymmetry we mean that the volatility reacts in an asymmetrical form to the returns, that is, volatility tends 
to rise in response to "bad" news and to fall in response to "good" news. 
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will assume some properties such as stationarity and ergodicity. On the other hand, working with 
simulated data is important in order to analyze the finite sample behavior of the estimators. 

Although different definitions of FIEGARCH processes can be found in the literature (see, for 
instance, Zivot and Wang, 2005 and Palma, 2001), we consider in this work the formulation proposed 
by Bollerslev and Mikkelsen (1996), which depends on the function X(z) — ^?fy(l — z)~ d . The 
conditions for the existence of power series representation for this function and the behavior of the 
coefficients in this representation are investigated throughout this work. This study is fundamental 
not only for simulation purposes but also to draw conclusions on the autocorrelation and spectral 
density functions decay of the non-observable process {ln(of )}t e z and the observable one {ln(X 2 )} teZ . 
We also provide a recurrence formula to calculate the coefficients of this series expansion for any 
p,q > 0- This recurrence formula allows to easily simulate FIEGARCH processes. 

We consider the fact that, given a FIEGARCH(j>, d, q) process {X t }tez, it is immediate that 
{ln(of)} te z is an ARFIMA(g, d,p) process, and we analyze the existence of an ARFIMA represen- 
tation for the process {ln(AT 2 )} te ^. This is very useful since the parameter estimation for ARFIMA 
models is well known (see Lopes, 2008 and references therein) and, to the best of our knowledge, 
this result is absent in the literature. Another contribution of this work is to provide the expressions 
for the asymmetry and kurtosis measures of FIEGARCH(p, d, q) process, for all p, q > 0. This re- 
sult extends the particular case considered in Ruiz and Veiga (2008). The ARFIMA representation 
of {ln(cr 2 )} teZ and the result that any FIEGARCH process is a martingale difference with respect 
to the natural filtration {Ttjtez, where T t '■= &({Z s }s<t), are applied to obtain the h-step ahead 
forecast for the processes {X^t&'E and {A" t 2 } tg ^. We also present the /j-step ahead forecast for 
both {ln(er 2 )}i SZ and {ln(X 2 )} teZ processes, with their respective mean square error forecast. To 
the best of our knowledge, formal proves for these expressions are not given in the literature of 
FIEGARCH(p, d, q) processes. 

Despite the fact that the quasi-likelihood is one of the most applied method in non-linear process 
estimation, asymptotic results for FIEGARCH process are still an open question (see Palma, 2007)[^] 
Therefore, we consider here a simulated study to investigate its finite sample performance. Since it 
is expected that, the better the fit, the better the forecasting, we also investigate the fitted models' 
forecasting performance. 

The paper is organized as follows: Section [2] presents the formal definition of FIEGARCH pro- 
cess and its theoretical properties. We give a recurrence formula to obtain the coefficients in the 
power series expansion of the polynomial that describes the volatility and we show their asymp- 
totic properties. The autocovariance and spectral density functions of the processes {ln(er 2 )} teZ 
and {ln(X 2 )} teZ are also presented and analyzed. The asymmetry and kurtosis measures of any 
stationary FIEGARCH process are also presented. Section [3] presents the theoretical results regard- 
ing the forecasting. Section [4] presents a Monte Carlo simulation study including the generation 
of FIEGARCH time series, estimation of the model parameters and the forecasting based on the 
fitted model. Section [5] presents the analysis of an observed time series and the comparison of the 
forecasting performance for different ARCH-type models. Section [6] concludes the paper. 

2 FIEGARCH Process 

In this section we present the Fractionally Integrated Exponential Generalized Autoregressive Condi- 
tional Heteroskedastic process (FIEGARCH). This class of processes, introduced by Bollerslev and 

2 The asymptotic properties for the quasi-likelihood method are well established for ARCH/GARCH models (see, 
for instance, Lee and Hansen, 1994; Lumsdaine, 1996; Berkes et al., 2003; Berkes and Horvath, 2003 and Hall and 
Yao, 2003) and also for EGARCH models (see, for instance, Straumann and Mikosch, 2003). 
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Mikkelsen (1996), describes not only the volatility varying on time and the volatility clusters (known 
as ARCH/ G ARCH effects) but also the volatility long-range dependence and its asymmetry. 

Here, we present some results related to the existence, stationarity and ergodicity for these pro- 
cesses. We analyze the autocorrelation and the spectral density functions decay for both {ln(of)} teZ 
and {ln(Xf)} t £z processes. Conditions for the existence of a series expansion for the polynomial 
that describes the volatility are given and a recurrence formula to calculate the coefficients of this 
expansion is presented. We also discuss the coefficients asymptotic behavior. We observe that if 
{Xt}tez is a FIEGARCH(p, d, q) process then, {ln(erf)} teZ is an ARFIMA(g, d,p) process and we 
prove that, under mild conditions, {\n(Xf)} te z is an ARFIMA(q, d, 0) process with correlated inno- 
vations. We present the expression for the kurtosis and the asymmetry measures for any stationary 
FIEGARCH(p, d, q) process. 

Throughout the paper, given a £ EU {— oo, +oo}, f(x) — 0(g(x)) means that |/(a;)l < c\g(x)\, for 
some c > 0, as x — > a; f(x) = o(g(x)) means that f(x)/g(x) -> 0, as x — > a; f(x) ~ g(x) means that 
f(x)/g(x) — > 1, as x — > a. We also say that f(x) « g(x), as x — > oo, if for any e > 0, there exists 
xo £ HI such that \f(x) — g(x)\ < e, for all x > xq. Also, given any set T, T* corresponds to the set 
T\{0} and Ia(-) is the indicator function defined as 1a(z) = 1, if z € A, and 0, otherwise. 

From now on, let (1 — B) d be the operator defined by its Maclaurin series expansion as, 

A, — — 

where T(-) is the gamma function, B is the backward shift operator defined by B k {Xt) = X t _^, for 
all fcGlN, and the coefficients 5dk are such that S^o = 1 and 6 dtk ~i — S dtk ~i ( k ~l~ d ), for all k > 1. 



Remark 2.1. Note that expression (2.1) is valid only for non-integer values of d. When d G IN, 
(1 — B) d is merely the difference operator 1 — B iterated d times. Also, one observe that, upon 
rep lacin g d by — d, the operator (1 — B)~ d has the same binomial expansion as the polynomial given 



in (2.1 ), that is 



(1 - B)~ d = J2 *-d,jB j ■■= nd < kBk > ( 2 - 2 ) 

where -r^j = 5-d,j-, for all j £ IN. Moreover, n^k ~ wrfpF 33 ' as A; — )• oo (see Prass, 2008). Therefore, 
■n d k = 0(k d ~ 1 ), as k goes to infinity. 



Suppose that {Z^tez is a sequence of independent and identically distributed (i.i.d.) random 
variables, with zero mean and variance equal to one. Let a(-) and /?(•) be the polynomials of order 
p and q defined, respectively, by 

P P 9 9 

a{z) = Yjy-<*iY = l~^a lZ l and f3(z) = ^{-^ = 1 - ( 2 - 3 ) 
with ao = (3q = —1. We assume that (3(z) ^ 0, if \z\ < 1, and that a(-) and /?(•) have no common 

0(B) 



roots. These conditions assure that the operator ^il is well defined. 



Definition 2.1. The process {X^tei, is a Fractionally Integrated EGARCH process, denoted by 
FIEGARCH(p, d, q), if 

X t = a t Z t , (2.4) 



\n(af) =t j + ^-(l- B)- d g(Z t ^), for all t e Z, (2.5) 
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where w£E and <?(•) is defined by 

g(Z t ) = 9Z t +<y[\Z t \ - E(\Z t \)], for all t £ Z, with 9,-y £ R. 



(2.6) 



Remark 2.2. Note that, in Definition 2.1 no conditions on the parameter d are imposed. Necessary 
and sufficient conditions on the parameter d, to guarantee the existence of the stochastic process 
{ln(cr t 2 )}t 6Z , satisfying (2.5), are discussed in the sequel. Also notice that when d = 0, we obtain 



the well known EGARCH process. Moreover, the theory presented here can be easily adapted to a 
more general definition where one may consider 



ln(o?) = 



OO 

E 

k=0 



c k g{Zt-i), for all t £ 



where {ck}k£M ls a sequence of real numbers for which the process {ln(of)}t e z is well defined, 
{^}tez is white noise process with variance not necessarily equal to one and g(-) is any measurable 
function. 



Figure 2.1 (a) presents a simulated time series {X t }™ =1 , with n = 2,000 observations, derived 



from a FIEGARCH(0, d, 1) process. Figure 2.1 (b) and (c) present the conditional variance {cj}™ =1 
and the logarithm of the conditional variance {ln(af )}™ =1 , both related to the same time series as in 
Figure [211(a). For this simulation we set d = 0.3578, = -0.1661, 7 = 0.2792, oj = -7.2247 and 



0.3578, 6 = -0.1661, 7 = 0.2792, oj 
0.6860. These are the parameter values of the FIEGARCH model fitted to the Bovespa index 



log-returns in Prass and Lopes (2011b). 






(a) {x t m°° 



(b) K 2 }?£1° 



(c) {ln(a?)}?£° 



Figure 2.1: (a) Simulated FIEGARCH time series {X t y}_ x , with n = 2, 000 observations, (b) The conditional 
variance of {X t }™ =1 and (c) the logarithm of the conditional variance. 



Notice that the function <?(■), given by the expression (2.6), can be rewritten as 



g(z t ) = 



(0 + ~f)Z t -iE(\Zt\), Z t >0; 
(0-7)Z t -7E(|Z t |), Z t <0. 



This expression clearly shows the asymmetry in response to positive and negative returns. Also, it 
is easy to see that <?(•) is nondinear if 6 7^ and the asymmetry is due to the values of 6 ±7. While 
the parameter 8, also known in the literature as leverage parameter, shows the return's sign effect, 
the parameter 7 denotes the return's magnitude effect. Therefore, the model is able to capture the 
fact that a negative return usually results in higher volatility than a positive one. 

From Prass (2008) it is known that if {Zt}tez is a sequence of independent and identically 
distributed random variables with zero mean and variance equal to one, then {g(Zt)}t£i, is a strictly 
stationary and ergodic process. Moreover, if E(Z 4 2 ) < 00, it is also weakly stationary (consequently, 
it is a white noise process) with mean zero and variance cr^, given by 



6 



Theoretical Results on FIEGARCH Processes 



cr 2 g = 9 2 +1 2 - hn\Z \)} 2 + 20 7 E(Z O |Z O |). 

These properties are extremely important to prove the results stated in the sequel. 
From now on, let A(-) be the polynomial defined by 



(2.7) 



X(z) = 



a(z) 

W) 



(l-z)- d :=J2 x d,kz\ for all |^| < 1, 



(2.8) 



fe=0 



where «(•) and are defined in (2.3). Since it is assumed that /3(-) has no roots in the closed disk 
{z : \z\ < 1}, and also a(-) and /?(•) have no common roots, the function X(z) is analytic in the open 
disc {z : \z\ < 1} ( if d < 0, in the closed disk {z : \z\ < 1}). Therefore, it has a unique power series 
representation and the operator given in (2.5 ) can be rewritten as j^(l-B)~ d = J2kLo ^<i,kB k = X(B). 
This representation is more convenient and will be extensively used when characterizing the process 
{ln(a?)} tez . 



Theorem 2.1 bellow gives the convergence order of the coefficients \d,ki as k goes to infinity. 
This result plays an important role when choosing the truncation point in the series representation 
for simulation purposes. 



Theorem 2.1. Let A(-) be the polynomial defined by (2.8). Then, for all k £ IN, the coefficients 
Ad,fc satisfy 

1 a(l) 



Xd,k 



r(d)fc!- d 0(1)' 



as k — >• oo. 



(2.9) 



Consequently, Xd,k = 0(k d x ) ; as k goes to infinity. 

Proof: Denote j3(z)~ l by f(z). Since /?(•) has no roots in the closed disk {z : \z\ < 1}, one has 



P(z)- 1 := f(z) = jr f k z k , where f k = for all k e IN 



fe=0 



From expressions (2.2), (2.8) and (2.10) it follows that 

k— i 



oo min{p,A:} 

K z ) = J2[ Yl ( n d , k -i-jfj 

3=0 



(2.10) 



(2.11) 



k=0 i=0 



From (2.11), one has 



min{p,fc} fe— i 

X d ,k = (~ a ^ ( X! n d,k-i-jfj) , for all k G M. 



i=0 



3=0 



k—i 



In particular, X d ,k = y^(-aj) ( Hdjfk-i-j) , for all k > p. 

Moreover, since fk — > 0, as k — >■ oo, it follows that for all e > 0, there exists ko > 0, such that, 
for a given m > and for all k > ko, \nd,jfk-i-j\ < &, for all < j < m and < i < p. Hence, for 
k sufficiently large, 

p k — i 

i—0 j—rn+1 

Notice that, since TT d , k ~ r(d)fc l - d > as ^ ~~ ^ °°' one can cnoose m o such that tt^ ~ TTd,k-i ~ TTrfjj for 
all mo <m + l<j<k — i and < i < p. Consequently, 



i=0 



y k— i— (m+1) p oo 



i=0 



J'=0 
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However, £°1 h = fi 1 ) = ^(i) and 7rd ' k ~ r(d)fc 1 -'* ' as k ~^ 00 ' So ' we have 

a(l) 1 a(l) 

d,fe - 7r d , fc — ~ r(d)fc 1 - d /?(T)' 

It follows that Ad,fc — > and Xd,kk 1 ~ d — > fp)~f(Ty' as ^ ~~ ^ 00 • Hence, A^fc = 0(A; d_1 ), as — )• 00, 
which concludes the proof. I 



Proposition 2.1 presents a recurrence formula for calculating the coefficients \dk, f° r an fc G IN. 



This formula is used to generate the FIEGARCH time series in the simulation study presented in 
Section [H 



Proposition 2.1. Let A(-) 6e the polynomial defined by the expression (2.8). The coefficients \d,k; 
for all k G IN, are given by 



fe— 1 fc — i 



A d , = 1 cwid A d , fe = -a% + ^ Al ( £ Pj S d,k-i-j) , for all k>l, 



(2.12) 



i=0 j=0 



where the coefficients 5d : k> f° r M k G IN, are given in (2.1) and 

( OL m , if 0<m<p; ( (3 m , if < m < q; 

a* m := i and (3* m := i 

{ 0, if m > p { 0, if m > q. 



(2.13) 



Proof: Let A(-) be defined by (2.8). Consequently, 



00 

a(z) -^(z)(l-z) d (^A d , fe ^ 



(2.14) 



fc=0 



By defining /3Z as in expression (2.13), for all k G IN, and upon considering expression (2.1 ), observing 
that 5do = —1 = /3o, the right hand side of expression (2.14) can be rewritten as 



fc=0 



fc=0 i=0 



3=0 



am - *) d ( £ A ^ fe ) = [£(£ -^.^> fc ] ( £ ^ fe ) = £ [ £ ( - £ ^ 

A;__0 i=Q A;=0 
00 fe— 1 fc— i 

= [ Xd - k - £ A <^ ( £ pj^k-i-o 



k-i-j 



fc=0 



i=0 



(2.15) 



that the equality (2.14) holds if and only if, 



Now, by setting at as in expression (2.13), for all k G IN, from expression (2.15) one concludes 



fc-i 



—an = Ad and 



k—i 

£ A <« ( £ $d,k-i-jP* 

i=0 3=0 



for all k > 1. 



Therefore, expression (2.12) holds. It is easy to see that by replacing the coefficients A^fc, given 
by (2.12), in the expression (2.15), for all k G IN, we get 2^feLo( — a fc) zfe = a(.z),which completes the 
proof. I 



From Theorem 



2.1 



one concludes that {Xd,k}keTN £ ^ 2 if an d only if d < 0.5 and {A^fcjfcgN G ^ 
whenever d < 0. Therefore, the random variable ln(of ), for all i G Z, is finite with probability one, if 
and only if d < 0.5. Moreover, since {g(Zt)}te , z is a white noise process, it follows immediately that 
{ln(of )} te ^ has an ARFIMA(q, d, p) representation. This fact is very useful for forecasting purposes 
(see Section [3]). Also, the following properties hold 
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if d < 0.5, the process {ln(af)} teZ is stationary (weakly and strictly) and ergodic; 
if d G (—1,0.5) and ce(z) / 0, for \z\ < 1, the process {ln(of)} teZ is invertible; 
if d < 0.5, the autocorrelation function of the process {ln(of )} te z is such that 

P\ n / C 2\{h) ~ ch 2d ~ 1 , as h — >■ oo, 
where c / 0, and the spectral density function of the process {ln(of )} te z is such that 

-»AM2 rP- /„,n\\ 2 



. m _^ Ke- A )| 2 h _ iA| -2 d ^/«(1)V 
/inK)(A)-^ |/3(e _ 4A)|2 |l-e | ~^\j7I)J A 



2d , as A -)• 0, 



where = Var(g(Zo)) is given in (2.7); 



Remark 2.3. In the literature one usually find that an ARFIMA(p, d, q) process is invertible for 
\d\ < 0.5 (see, for instance, Brockwell and Davis, 1991). However, Bloomfield (1985) seems to be the 
first to notice that this range can be extended to d G (—1,0.5), for an ARFIMA(0, d, 0). Recently, 
Bondon and Palma (2007) show that this result actually holds for any ARFIMA(p, d, q). 

As a consequence of these properties we have the following corollary that extends the results in 
Nelson (1991), where only the case d = is considered. 



Corollary 2.1. Let {Xt}te% be a FIEGARCH(p, d, q) process, given in Definition 2.1 If d < 0.5, 

then {Xt}tez and {of }t e z « r e strictly stationary and ergodic processes. 



Proof: Suppose that d < 0.5. Then, the process {ln(of)} teZ is weakly stationary and the random 
variable ln(of ) is almost surely finite, for all It follows that of and Xt are also almost surely 

finite, for all t G Z. Since of = exp{ln(of )}, for all t G Z, and the process {ln(er 2 )} teZ is strictly 
stationary and ergodic, it follows that {of }tez is also a strictly stationary and ergodic process. 



From expressions (2.4), (2.5) and (2.8), one has 



X t = Z t exp{-(w + ^A d)Jfe 0(Z t _i_ k ))}, for all t G 



Z. 



fc=0 



Since {Zt}t£ , z is a sequence of i.i.d. random variables, it follows that {Xt\t^% is a strictly stationary 
and ergodic process. I 



Corollary 2.1 shows that {X t } t ^% and {of} te ^ are strictly stationary and ergodic processes. 
However, this does not imply weakly stationarity when the random variable for t G Z, is such 
that either its mean or its variance is not finite. Theorem 2.2 in Nelson (1990) considers the 
generalized error distribution (GED) function and proves the existence of the moment of order 
r > 0, for the random variables Xt and erf, for all t G Z, when the process {ln(of)} teZ is defined 
in terms of a square summable sequence of coefficients. Theorem |2.2| bellow shows that, whenever 
d < 0.5, the theorem 2.2 in Nelson (1990) also holds for FIEGARCH(p, d, q) processes. 



Theorem 2.2. Let {Xt}tez be a FIEGARCH(p, d, q) process, given in Definition 2.1 Suppose 9 
and^y are not both equal to zero andd < 0.5. Also suppose that {Zt}tez is a sequence of i.i.d. random 
variables with zero mean and variance equal to one, with GED(w) distribution function, where v > 1. 
Then, the processes {Xt}tez and {o~t}t<=z ar & such that E(AJ) < oo and E([of ] r ) < oo, for allt G Z 
and r > 0. 
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Proof: For any FIEGARCH(p, d, q) process, from expressions (2.5) and (2.8), ln(of ) can be rewrit- 
ten as 



ln(of) = ui + \d,kg(Z t -i- k ), for all t E 



(2.16) 



fe=0 



It is easy to see that the expression ( 2.16[ ) is equivalent to expression (2.1) of Nelson (1991). Indeed, 
one just needs to set at = to, for all t € Z, and (3k = ^d,k-i, for all k > 1. Moreover, by definition, 
the polynomial /?(•) has no roots in the closed disk {z : \z\ < 1}. Therefore, if d < 0.5, then 
Yl'kLo ^dk < 00 • Under these conditions, the result follows from Nelson (1991), theorem 2.2. I 



Since {ln(of )} te z is an ARFIMA process, its properties are well known. However, in practice, 
this process cannot be directly observed and frequently, knowing its characteristics may not help 



to identify a model that fits the data. On the other side, by expression (2.4), one can write 
ln(Xf) = ln(er t 2 ) + \n(Zf), for all t £ Z. Since the process {X^t^x is observable, {ln(X t 2 )} 4eZ is a 
known process. 

Perez and Zaffaroni (2008) and Hurvich et al. 



(2005) consider models such that Xt can be 



written as in (2.4), but at can have a more general definition than (2.5). While Perez and Zaffaroni 



(2008) consider maximum likelihood and Whittle's method of estimation in the class of exponential 
volatility models, especially the EGARCH ones, Hurvich et al. (2005) consider different semipara- 
metric estimators of the memory parameter in general signal plus noise models. In both cases, 
to obtain an estimator by Whittle's method, the authors consider the spectral density function of 
{ln(X 2 )} teZ . 



Here we consider only the case where Xt can be written as in (2.4), and at is defined through 
the expression (2.5) and we present some properties of the process {ln(X 2 )} tgZ such as stationarity 



and ergodicity. We also show that, under mild conditions, this process also has an ARFIMA repre- 
sentation. To the best of our knowledge, up to this date, no formal proves of these results are given 
in the literature of FIEGARCH(p, d, q) processes, specially the ARFIMA(g, d, 0) representation of 
{ln(X 2 )} teZ . 



Theorem 2.3. Let {Xt}te% be a FIEGARCH(p, d, q) process, given in Definition 2.1, If d < 0.5 
and E([ln(Z 2 )] 2 ) < oo ; for all t £ Z, then the process {\n(Xf)} te z is well defined and it is stationary 
(weakly and strictly) and ergodic. Moreover, the autocovariance function of {ln(X 2 )} teZ is given by 



lln(x 2 )( h ) = X! Ad <fc Vfc+H + Var ( ln (^t 2 )) I {o}W + Vh-i/CIzK/i), for all he Z. 



(2.17) 



fe=0 



where ai is given in (2.7) and K, — Cov(g(Z ), Iii(Zq)) . 



Proof: If d < 0.5, the random variable ln(cr 2 ) is finite with probability one, for all t € Z. If 
E([1h(Zq)] 2 ) < oo, one has E(| 1ii(Zq)|) < oo and the random variable | ln(Z 2 )| is also finite with prob- 
ability one, for all t G Z. Which implies that the random variable ln(X 2 ) is finite with probability 
one, for all t € Z. Therefore, the stochastic process {ln(X 2 )} te z is well defined. By definition, 
{ZtjteZ is a sequence of i.i.d. random variables and ln(er ( 2 ) and ln(Z 2 ) are measurable functions of 
{Z^tez- Therefore, the process {\n(Xf)} teZ is strictly stationary and ergodic (see Durrett, 1991). 

Since {ln(X 2 )} teZ is a strictly stationary process, to prove that it is also weakly stationary 
one only needs to show that E([ln(X 2 )] 2 ) < oo or, equivalently, Var(ln(X 2 )) < oo, for all t G Z. 
Notice that, if d < 0.5, {ln(of )} te z is weakly stationary which implies that Var(ln(cr 2 )) < oo and, if 
E([ln(Z 2 )] 2 ) = E([ln(Z^)] 2 ) < oo, then Var(ln(Z 2 )) < oo, for all t € Z. Moreover, from expressions 
(2.5) and (2.8), ln(cr 2 ) is a measurable function of {Z s } s <t-i 
independent, for all t & Z. Hence, 



It follows that, ln(of) and ln(Z 2 ) are 
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Var(ln(A 2 )) = Var(ln(cr t 2 )) + Var(ln(Z t 2 )) < oo, for all t G Z. 



To complete the prove we need to show that the autocovariance function 7i n (x=)(^)) f° r an h G Z, 
is given by expression (2.17). From the definition of ln(Xf), it follows that 



Cov(ln(A 2 +/l ),ln(A 2 )) = Cov( ln(a 2 +/l ), ln(of )) + Cov( ln(Z 2 +/l ), ln(Z 2 )) 

+ Cov(ln(a 2 + ,J,ln(Z 2 )) + Cov(ln(Z 2 +/l ),ln(a 2 )). 



(2.18) 



Since {g(Zt)}tei, is a white noise process, from expression (2.16) one concludes that 



Cov(ln(cr t 2 +h ),ln(cr 2 )) = cr 2 X d,k^d,k+\h\ > for all h G 



fc=0 



From the independence of the random variables ln(Zf), for all t £ Z, and from expression (2.16), 
we have 



n v I 0, if h ^ 0; . „ . [ Arf h_i/C , if ft. > 0; 

Cov(l„(^), 1„( Z »» - { Var(ln(z<2))i if ft ! „ and Cov(l„« k >, MZf) ) -| Q if h & Q ; 

where £ = Cov(g(Z Q ), ln(Z 2 )) . Since Cov(ln(Z 2 +/l ),ln(<7 2 )) = Cov(ln( ( r 2 _, l ) ! ln(Z 2 )), with u = t + h, 
one concludes that 

Cov(ln(a 2 + ,),ln(Z 2 )) +Cov(ln(Z 2 + , l ),ln( ( r 2 )) = Cov( ln(<7 t 2 +|h| ), ln(Z 2 )) = X di{h \_ 1 JCIz.(h). 



By replacing these results on expression (2.18) we conclude that the autocovariance function of 
{ln(A 2 )} teZ is given by Km. I 



Figure 2.2 (a) presents the theoretical autocovariance function of the process {ln(A 2 )} te z, where 



{X t }tez is a FIEGARCH(0, d, 1) process, with the same parameter values considered in Figures 2.1 
and 2.3. Figure 2.2 (b) shows the sample autocovariance functio n of the tim e se ries {lnpT 2 )} 2 ™ , 

(c) presents the 



2.1 



Figure 



2.2 



where {Xt}^® is the simulated time series presented in Figure 

sample autocovariance function of the time series {ln(r^)}" =1 , where {^}™ =1 is the Bovespa index 
log- returns time series (see Section [5]). By comparing the three graphs in Figure 2.2, one concludes 
that all three functions present a similar behavior. Since the sample autocovariance function is an 
estimator of the theoretical autocovariance function, it is expected that their graphics will have the 



same behavior. The similarity between the decay in the graphs in Figure 2.2 (b) and (c) indicates 



that a FIEGARCH model seems appropriate for fitting the Bovespa index log-returns time series. 

It is known that {ln(cr 2 )} te z is an ARFIMA(q, d,p) process. The following theorem shows that, 
under mild conditions, {ln(A t 2 )} te z is an ARFIMA(g, d, 0) process with correlated innovations. In 
what follows, let </>(•) be the polynomial defined by 



(z) :=f3(z)(l-z) d = J2<PkZ k , for \z\ < 1. 



(2.19) 



k=0 



Theorem 2.4. Let {Xt}tei, be a FIEGARCH (p, d, q) process, given in Definition 2.1 Suppose 
\d\ < 0.5 and E([ln(Z 2 )] 2 ) < oo. Then, {ln(A 2 )} teZ is an ARFIMA(g, d, 0) process givenby 



/8(B)(1 - B) d {\u{Xf) -u) = e t , for all t G Z, 
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(a) 7ln(X»)(0 ( b ) 7ln(x2)0) (C) 7ln(r»)(0 

Figure 2.2: (a) Theoretical autocovariance function of the process {ln(X 2 )} te ^, where {X t } t& z is a 
FIEGARCH(0, d, 1) process and (b) sample autocovariance function of a time series {ln(X|)}|£j° derived 
from that FIEGARCH(0, d, 1) process; (c) sample autocovariance function of the time series {ln(r 2 )}t£i 7 , 
where {r^Z 1 / is the Bovespa index log-returns time series. 



where {£t}tez is a stochastic process with zero mean and autocovariance function 7 £ (-) given by 



P P P oo 

a g/^2 a i a i-\h\ +/Cy^ai<?i>j + | fe | +1 +/C ai^_| h | +1 + c 2 fiifo-Wi */ < < p; 

i=\h\ i=0 i=|h| 



oo 

i=\h\ 



if \h\=p + l; 



if \h\>p + l, 



(2.20) 

with a 2 g given in (2.7), JC = Cov(g(Z ), ln(Z 2 )), ctj is given in (2.3), /or a// i 6 {0, • • • ,p}, cfik is 
defined in ( |2.19| ) ; /or k G IN, and of := Var(ln(Z 2 )). 



Proof: Let {A(}j e ^ be a FIEGARCH process. From expressions (2.4) and (2.5) we have 

P(JS){1 - B) d (\n{X?) -u) = e u for all t e Z, 

where 

e t = a(S). 9 (Z t _ 1 ) + /3(S)(l-6) d ln(Z f 2 ), for all t € Z. (2.21) 

In particular, if d > 0, we have /3(£)(1 - £) d w = and - B) d \n{X^) = e u for all t £ Z. 

Now, suppose that E([ln(Zp)] 2 ) < oo. Since {Zt}t£Z is a sequence of i.i.d. random variables and 
< |E(ln(Z 2 ))| < E(|ln(2g)|) < ^([ln^)] 2 )] 1 / 2 , one concludes that E(ln(Z t 2 )) = E(ln(Z 2 )) < oo, for all 
t E Z. Therefore, /3(B)(1 - S) d E(hi(Z t 2 )) = and a(B)E(g(Z t - 1 )) = 0. Consequently, E(e t ) = 0, for 
all t e Z. 

Let (/)(■) be defined by expression (2.19). Assume, for the moment, that Var(e^) < oo, for all 
t 6 Z. It follows that 



Cov( £ ^, £t )=Cov(a(S).g(Z t+ , l _ 1 ),a(6).g(Z t _ 1 ))+Cov(0(S)ln(Z 2 +/l ),0(6) ln(Z 2 )) 

+ Co V (a(B)g(Z t+h ^),<p(B)HZf)) + Cov( ( f>(B)HZ^ h ),a(B)g(Z^ 1 )). 



(2.22) 



Since {g(Z t )}t^z is a white noise process we have 
Cov(a(B)g(Z t+h ^), a(B)g(Z t _ 1 )) -- 



Vai(g(Z )) ^ <Xi<*i-\h\, if \h\<P\ 
i=\h\ 

0, if \h\>p, 
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which does not depend on t G Z. From the independence of the random variables Zt, for all t G 
one has 



Cov(a(B)ff(Z t+h _i), 0(B) ln(Z 2 )) 



/C 2J Ui&i-h+i, if h < 1; 
p 



=fc-l 



and 



0, 



0, if ft, > p + 1 

if ft<-(p+l); 



Cov^^ln^)^^)^^!)) = < 



/C y^cti</> i+h+1 , if -(p + 1) < /i < -1; 

i=|/i|-l 
P 

£ a^+Zi+i, if h>-l. 

i=Q 



where K. := Cov(f/(Z ), ln(Zg)) does not depend on t G Z. Also, from the independence of the random 
variables ln(Z 2 ), for all i G Z, we have 



Cov(0(B)ln(Z t 2 +h ),0(B)ln(Z t 2 )) = Var(ln(Z 2 )) £ &0 Hh |, 

i=|h| 

which does not depend on t G Z. 



for all fteZ, 



Therefore, all four terms in expression (2.22) do not depend on i G Z and expression (2.20) 
holds. Now, to validate expression (2.20) we only need to show that Var(ej) < 00, for all t G Z. 
Notice that, since E(ej) = 0, it follows that E(ef) = Var(e() = 7 e (0). Upon replacing h = in 
(2.22) one obtains 

■ ■ p p 00 

7,(0) - Var(.g(Z )) ]T a 2 + 2/C £ + Var(ln(Z 2 )) £ 2 . 

i=0 i=0 i=0 

By hypothesis, E([ln(Z^)] 2 ) < 00 and d G (-0.5,0.5). It follows that Var(ln(^)) < 00 and ££L $ < 
00. We also know that Var(g(Zo)) < 00. In order to show that JC < 00, notice that K, := 
Cov( 5 (Z ),ln(Z 2 )) = E( ff (Z )ln(Z 2 )) and, since E(Z$) = 1 and Var(ln(Z 2 )) < 00, from Holder's 
inequality, we have E(|Zo|) < 00 and E(1ii(Zq)) < 00. Then, from (2.6) it follows that 

E(g(Z )ln(Z%)) = 0E(Z o ln(Z o 2 )) + 7 E(|Z | ln(Z 2 )) - c, 

where c := 7E(|Z |)E(ln(Z^)) < 00. By using the fact that 2ab < a 2 + b 2 , for all a, b G E, one 
concludes that 

|E(Z t ln(Z 2 ))| < *[E(Z 2 )+E(ln(Z 2 ))] < 00 and |E(|Z t | ln(Z 2 )) | < \ [E(Z t 2 ) + E( ln(Z 2 ))] < 00. 

Hence E(g(Z ) Iii(Zq)) < 00 and, consequently, Cov(g(Z ),lri(Z J )) < 00 and 7 e (0) < 00. Therefore, 
the result follows. I 



Remark 2.4. From Theorem |2.4| it follows that all classical results related to the stationarity and 
ergodicity of an ARFIMA process (see Lopes, 2008 for ARFIMA processes with non-correlated 
innovations) can be extended to the process {ln(X 2 )} teZ , where {Ai} tg ^ is a FIEGARCH(p, d, q) 
process. 

The spectral density function of the process {ln(AT 2 )} teZ is given by (see Hurvich et al., 2005) 

/m(x ? )(A) = /in(„?)(A) + -5R(e- A A(A)) + / ln(z?) (A), for all A G [0,vr], (2.23) 
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where K. = Cov(g(Z ),ln(Zo)), 9t(z) is the real part of z and A(z) := A(e lz ). 

As an illustration, Figure [273] (a) shows the spectral density function of the process {ln(Xf)} te %, 
where {X t } teZ is any FIEGARCH(0, d, 1) process with d = 0.3578, 9 = -0.1661, 7 = 0.2792, 
oj = —7.2247 and /3% = 0.6860. Figure 2.3 (b) shows the periodogram function of the time series 

231 fa) 



{ln(X?)}™f , where {X t }^0 is t h e simulated time series presented in Figure 



Figure 2.3 



shows the periodogram function of the time series {ln(rf )}tl\ 7 , where {^jw^ is the Bovespa 



index log-returns time series. One observes that the graphs in Figure 2.3 (b) and (c) present similar 



behavior, indicating that a FIEGARCH model may be adequate to fit the data. 





( a ) /hx(x|)(" 



( b ) 4i(X?)(" 



( c ) 4i(r t 2 )(') 



Figure 2.3: (a) Theoretical spectral density function of the process {ln(X 2 )} te z, where {X t }t^i, is a 
FIEGARCH(0, d, 1) process with d = 0.3578, = -0.1661, 7 = 0.2792, cj = -7.2247 and ft = 0.6860, 

(b) periodogram function related to a time series {X t } 2 £™ derived from this FIEGARCH(0, d, 1) process; 

(c) the periodogram function related to the time series {ln(r^)}Jl\ 7 , where {rt]\™ is the Bovespa index 
log-returns time series. 



The following proposition presents the expression of the kurtosis and the asymmetry measures 
for any stationary FIEGARCH(p, d, q) process. This result generalizes the one in Ruiz and Veiga 
(2008) where the authors consider only the particular case p = and q = 1. 



Proposition 2.2. Let {Xt}t£z be a stationary FIEGARCH(p, d, q) process, given in Definition 2.1 
7/E(Zq) < oo ; the asymmetry measure of {Xt\tz% is given by 



A x = E(Z 3 ) A " 



rjE(e X p{5A d , fc3 (Z )}) 

fc=0 

00 „ , 

[nE(exp{A d , fc5 (Z )}) 



and, i/E(Zg) < oo ; the kurtosis measure of {Xf}tez is given by 



K x = HZo) 



nE(exp{2A d , fc (?(Zo)}) 

fc=0 

00 f 
[liE(e»p{Ad,fcff(2b)})]' 



where Xd,k are given in (2.8) and g(-) is defined by (|2.6|). 



Proof: L et { Xt}tez be any stationary FIEGARCH(j?, d, q) process and A(-) be the polynomial 
defined by ( |2.8| ). Since, by definition, a t and Z t are independent, for all t € Z, it follows that 

E(X[) = E(JfJ) = E(«t5)E(ZJ), for all t € Z and r > 0. 

In particular E(Z 2 ) = E(Z 2 ) = 1 and E(X t 2 ) = E(X^) = E(cr 2 Z 2 ) = E(cr 2 ). 
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Since {g{Zt)}t^'E is a sequence of i.i.d. random variables, from (2.5) it follows that 

oo 

:K) = eTj] E(exp{^A (i , fe3 (Z )}), for all r > 0. 



E( 



(2.24) 



k=0 



Also, it is known that E(Xo) = and E(.Z'o) = 1, for all i 6 Z. Therefore, the asymmetry and the 
kurtosis measures of {X t }t£i. are given, respectively, by 



nx?) n^mzf) n , ^ e(x 4 ) e^mz 4 ) 



[E(X t 2 )]3/2 [E(a?)]3/2 



[ELY 2 )] 2 " [E(a t 2 )] 2 



(2.25) 



Upon replacing (2.24) in (2.25) we concludes the proof. 



Figure [2~4| shows the theoretical value of the kurtosis measure, as a function of the parameter d, 
for any FIEGARCH(0,d,l) process, with Gaussian noise and parameters 9 = —0.1661, 7 = 0.2792, 
-7.2247 and (a) ft = 0.6860 (b) ft = —0.6860. The parameter values considered in Figure 2.4 



UJ 



(a) are the same ones (except for d) considered in Figure 2.1 (a). For the specific model considered 
in Figure 2.1 d = 0.3578 and the theoretical value of the kurtosis measure is 5.6733. The sample 



kurtosis value of the simulated time series presented in Figure 2.1 (a) is 5.3197, which is very close 
to the theoretical one. It is easy to see that, while in Figure 



2.4 



the kurtosis values increase 
exponentially as d increases, in Figure 2.4 (b) the kurtosis values decrease for —0.5 < d < 0.3 and 
increase for 0.3 < d < 0.5. 



(a) ft = 0.6860 (b) ft = -0.6860 

Figure 2.4: (a) Kurtosis measure of a FIEGARCH(0, d, 1) process as a function of the parameter d with 
= -0.1661, 7 = 0.2792, u = -7.2247 and ft = 0.6860; (b) Kurtosis measure of a FIEGARCH(0, d, 1) 
process with the same parameters as in (a) but with ft replaced by ft = —0.6860. 



3 Forecasting 



Let {X t }tez be a FIEGARCH (p, d, q) process, given in Definition 2.1, and {X t }™ =1 a time series 
obtained from this process. In this section, we prove that {Xt}tei, is a martingale difference with 
respect to the filtration {J^jteZi where Tt ■= cr({Z s } s < t ), and we provide the /i-step ahead forecast 



for the process {X^tew,- Since the process {ln(af)} teZ , defined by (2.5), has an ARFIMA(g, d,p) 
representation, the /i-step ahead forecasting for this process and its mean square error value can 
be easily obtained (for instance, see Lopes, 2008 and Reisen and Lopes, 1999). This fact is used to 
provide an fa-step ahead forecast for {\n(Xf)} te z and the mean square error of forecasting. We also 
consider the fact that E(X t 2 ) = E(of ), for all t € Z, to provide an /i-step ahead forecast for both 
processes, {X t 2 } teZ and {of }tez, based on the predictions obtained from the process {ln(cr 2 )} t6 z- 
The notation used in this section is introduced bellow. 
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Remark 3.1. Let Yt, for t G Z, denote any random variable denned here. In the sequel we consider 
the following notation: 

• we use the symbol """ to denote the /i-ahead step forecast defined in terms of the conditional 
expectation, that is, Yt+h = E(Y f+ft |J r t ). Notice that this is the best linear predictor in terms of 
mean square error value. The symbols "~" and """ are used to denote alternative estimators 
(e.g. Y t+h and Y t+h ); 

• for simplicity of notation, for the h-step ahead forecast of ln(Yt+h), we write ln(Yt+h) instead 
of l^Y+~h) (analogously for "~" and """); 

• we follow the approach usually considered in the literature and denote the /i-ahead step 
forecast Y? +h as if +h instead of Y t 2 +h . If necessary, to avoid confusion, we will denote the 
square of Y t +h a s (Yt+h) 2 (analogously for "~" and """). 



The following lemma shows that a FIEGARCH(p, d, q) process is a martingale difference with 



respect to {Tt\t&- This result is useful in the proof of Lemma 3.2 that presents the /i-step ahead 
forecast of X n +h, for a fixed value of n G Z and all h > 1, and the 1-step ahead forecast of A^ +1 , 
given T n . 



Lemma 3.1. Let {Xt}tez be a FIEGARCH(p, d, q) process, given in Definition 2.1 and T t 



a({Z s } s < t ). Then, the process {Xt}te% is a martingale difference with respect to {T t }teE- 

Proof: From definition, o~t is a J^-i-measurable function. Moreover, E(X t ) = E(E(X t |J r t _ 1 )) and 
E(X t \J' t -i) = E(cr t Zt|J r t_i) = CT t E(Zt|J" t _i) = 0. Therefore, the process {Xt}tei, is a martingale differ- 
ence with respect to {J r t}te^- ■ 



Lemma 3.2. Let {Xt}teij be a stationary FIEGARCH(p, d, q) process, given by Definition 2.1 
Then, for any fixed n G Z, the h-step ahead forecast of X n +h, for all h > and the 1-step ahead 
forecast of X^ +1 , given T n , are, respectively, X n +h = and X% +1 = of +1 . 



Proof: From Lemma 3.1 , a FIEGARCH(p, d, q) process is a martingale difference. It follows that 
X n+ h = ~E(X n +t l \J r n ) = 0, for all h > 0. From definition, W,(X^ +1 \T n ) = cr^ +1 . Therefore, the 
1-step ahead forecast of X^ +1 , given F n , is + 1- Moreover, if E(A 4 4 ) < oo, for all t G Z, then, this 
is the best forecast value in mean square error sense. I 

To obtain the h-step ahead forecast for X^ +h , notice that at and Z t are independent and so are 
of and Z\, for all t G Z. Moreover, E(Z 2 + J T n ) = E(Z 2 + J = 1, for all h > 0. It follows that 

X 2 n+h :=E(Xl +h \T n )=^l +h \J 7 n):=al +h , for all h > 0. 

While for ARCH/GARCH models, E(cr 2 i+ , 1 |J' t ) can be easily calculated, for FIEGARCH processes, 
what is easy to derive is the expression for the /i-step ahead forecast for the process {ln(<r 2 )}t eZ , for 
any h > 1. The expressions for ln(a^ +h ) := E(ln(cr 2 +fl )|J" t ) and for the mean square error of forecast 



are given in Proposition 3.1. We shall use this result to discuss the properties of the predictor 



obtained by considering af l+h := exp{ln(cr 2 , +/l )}, for all h > 0. 



Proposition 3.1. Let {Xt}tei, be a FIEGARCH(p, d, q) process, given by Definition 2.1. Then, 



the h-step ahead forecast ln(a% +h ) of\n(cr 2 +h ), given T n = a({Z t }t< n ) , n G IN, is given by 

oo 

ln(cr 2 + ,,) = u> + >>d,k+h-i g(Z n -k), for all h > 0. (3.1) 



fe=0 
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Moreover, the mean square error forecast is equal to zero, if h = 1, and it is given by 



h-2 



E([ln(a, 2 i+h ) - Ha 2 n+h )] 2 ) = a 2 g £ \% k , ifh>2, 



(3.2) 



fc=0 



where a 2 := E([g(Z )] 2 ) is given in (2.7). 



Proof: Let ln(cr 2 +/l ) := E(ln(cr 2 +/t )| F n ). Note that W,(g{Z t )\T n ) = E(g(Z t )) = 0, for all t > n, and 



E(5(Z t )|J r „) = g(Z t ), for all t < n. By the expression (2.16), one has 

oo oo 

+ ^ ^d,k g(Z n +h-l-k), 



ln(a 2 



n+h) 



^ A djfc 'E l (g{Z n+h ^ 1 -k)\ Fn) 



k=0 



k=h-l 



and expression (3.1) follows. 



Since ha.(a 2 l+h ) is a function of {5(^ s )}s<n+/i-i an d {g{Z t )} t <£z is a sequence of i.i.d. random 
variables with zero mean and variance a 2 := E([g(Z )] 2 ), we conclude that 



h-2 



M([\n(a 2 n+h )-\n(a 2 n+h )y 
and zero if h = 1. 



E 



h-2 



•n+h-l-k j 



fe=0 



d.k> if ft > 2, 



fe=0 



In practice, E(<7^ + JJ-t) cannot be easily calculated for FIEGARCH models and thus, a common 
approach is to predict o- 2 l+h through the relation cr 2 +h '■= exp{ln(<j 2 +h )} , with hx(a 2 l+h ) defined by 
(3.1), for all h > 0. As a consequence, a /i-step ahead forecast for X^ +h is defined as X 2 +h :— o 2 n+h 



and a naive estimator for ln(X 2 +h ) is obtained by letting 



ln(X 2 + J:=ln(^ + J=ln(^ +?1 ,)=lnK + J, for all h > 0. 



x2 



(3.3) 



From expressions (|2.4[) and (3.3), it is obvious that ln(X 2 +?1 ) is a biased estimator for \n(X 2 +h ), 



whenever W,(\n(Z 2 +h )) ^ 0. Proposition 3.2 gives the mean square error forecast for the h-step ahead 
forecast of ln(X 2 +h ), defined through expression (3.3). 



Proposition 3.2. Let ln(X^ +fc ), for all h> 0, be the h-step ahead forecast ofln(X^ +h ), given the 
filtration T n = a({Z s } s < n ) , defined by expression (3.3). Then, the mean square error forecast is 
given by 



h-2 



E([HX 2 n+h )-\n(X' 2 +h )Y) =a 2 g Y,*lk+mHZ 2 n+h )] ), where a 2 := E([ 3 (Z )] 2 ). 



from Proposition 3.1 



Proof: By expression (3.3), ln(X^ +h ) := ln(a^ +h ), for all h > 0. Thus, from expression (2.4) and 



we nave 



E([ln(X 2 + J- \n(X 2 l+h )} 2 ) = E([ln(X 2 + ,J - H<r 2 n+h )} 2 ) = E([ln(a 2 + J + \n(Z 2 l+h ) Ha 2 n+h )\ 



h-2 



= E([^A d , fc3 (Z n+ft _ 1 _ fc )+ln(Z 2 + j] ) 



fc=0 



(3.4) 



By expanding the right hand side of expression (3.4) and using the fact that {Z^tez is a sequence 
of i.i.d. random variables, the proposition follows immediately. I 



Remark 3.2. If the values of Xt and at are known only for t 6 {1, • • • , n} then, the /i-step ahead 
forecast ln(<r 2 +/l ) of ln(er 2 +/l ), is approximated by 
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+h 



^ Xd,k+h-l g(Z n -k), for all h > 0, 



fc=0 



and, by definition, the same approximation follows for ]n(X* +h ). It is easy to see that, in this case, 
the mean square error of forecast values for the processes {ln(er 2 )} teZ and {ln(X 2 )} teZ are given, 
respectively, by 



h-2 



•,k I > 



and 



E( [ Hal +h ) ln(al +h )] 2 ) = o* ( £ \\ k + X l 

k—0 k=n+h—l 
h — 2 oo 

E( [ hx{Xl +h ) - HXl +h )] 2 ) = of ( X l,k + E X l* ) + E ( [ HZl+h)] 2 ) , for all h > 0. 



fc=0 k=n+h— 1 

From Jensen's inequality, one concludes that 

:= exp{\n{al +h )} = exp{I](\n(<j 2 n+h )\T t )} < ^{a 2 n+h \T n ) := o* +fcl for all h > 0, 

so that E(o* - o 2 ^) = E(E(o^. A - *? +/ J.F„)) = E(o 2 +/l - *l +h ) < 0, for all h > 0. In fact, from 
(2.9) and (3.1), we have 

°° h oo 

°l+h := cxp{ln(<7£ +h )} =exp{w + 5^A«, )fc+h _iff(Z n _ fc )} e w = exp{E( ln(o 2 ))}. (3.5) 

Another /i-step ahead predictor for cr^ , ft can be defined as follows. Consider an order 2 Taylor's 
expansion of the exponential function and write 

a 2 l+h = exp{E(ln(a I 2 l+ J|J-„)} + [ln( CT 2 + J - E(ln(a 2 + J|.F„)] exp {E(ln(o 2 + J|J„)} 

+ \[H°l+h) -E(ln(^ +/l )|J-„)] 2 exp{E(ln(^ +fc )|J n )}+i2 n+ft , for all h > 0, (3.6) 



From expression (3.6), a natural choice is to define a /i-step ahead predictor for cr^ +h as 

ol +ft :=exp{E(ln(o 2 i+ 0|jr n )} + ^ (3.7) 
for all h > 0. 

From expressions (3.1) and (3.2) one concludes that <5"^ +/l and c^ +h are related through the 
equation 



^ 2 

a n+h 



exp{\n(al +h )} = a, 
. h-2 



2 

n+hi 



if /i = 1; 



h-2 



exp{ln(<r2 +1 )}(i+ ^J-^ ) = a^fl + -oJJ^A^ , if /» > 1. 



fc=0 



(3-8) 



fc=0 



Since o"t+i is a J^-measurable random variable, for all t E Z, we have E(<7^ +1 — cr^ +1 ) = E(d^ +1 
°n+i) = 0- From equation (3.6), we easily conclude that, for all h > 1, 

^ h-2 

- ol +h ) = -HRn+k) and E(a 2 n+h - o 2 + J = - (l + -a 2 g ]T A^)e(«t 2 + J - E(i?„ +/t ). 



fc=0 
^2 



Therefore, the relation between the bias for the estimators cr^ +h and o^+h * s g iven by 



h-2 



E(5£+ h - <rj+ h ) = H°l+h - °l+h) + E(o 2 + J (l + -o 2 >?d,k) > for all ft > 1 



fc=0 



In Section |4] we analyze the performance of through a Monte Carlo simulation study. 
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4 Simulation Study 

In this section we present a Monte Carlo simulation study to analyze the performance of quasi- 
likelihood estimator and also the forecasting on FIEGARCH(p, d, q) processes. The set of parameters 
considered in this study is the same as in Prass and Lopes (2011a) and Prass and Lopes (2011b), 



except for model M5 (see Table 4.1). Prass and Lopes (2011a) present a Monte Carlo simulation 
study on risk measures estimation in time series derived from FIEGARCH process. Prass and Lopes 
(2011b) analyze a portfolio composed by stocks from the Brazilian market Bovespa. The authors 
consider the econometric approach to estimate the risk measure VaR and use FIEGARCH models 
to obtain the conditional variance of the time series. We also present here the h step-ahead forecast, 
for h € {1, • • • , 50}, for the conditional variance of simulated FIEGARCH processes. 



4.1 Data Generating Process 



In the following, the notation Mi, for i £ {!.,■•• ,5}, indicates the simulated FIEGARCH(p, d, q) 
models. For each model we consider re = 1,000 replications, with sample size n £ {2, 000; 5, 000}. 
The value n = 2, 000 is the approximated size of the observed time series considered in Prass and 
Lopes (2011b). The value n = 5,000 was chosen to analyze the estimators asymptotic properties. 

For each i g {1, • • • ,5}, {Xi : t}" =1 denotes any time series, with sample size n, derived from the 
model Mi. To generate these time series we set Z ifi ~ JV{0, 1), for all i S {!,••■ ,5}, consider 



the expressions given in Definition 2.1 and the equality given in expression (2.8). Then, for each 
t€ {!,••■ ,5}, 



ln((7? t ) = u + |U (1 - B)- d g(Z ilt -x) = £ A,, A «/;/,, : i « ]T A d)fcff (Z i , 4 _ 1 ), 



for m large enough. We set m = 50,000 based on the result presented in Theorem 2.1 The 



parameters value for each model are given in Table 4.1 



Table 4.1: Parameters value for the models. 



Model 


Parameter 


d 


6 


7 




CXI 








/3s 


fa 


Ml 


FIEGARCH(2, d, 1) 


0.4495 


-0.1245 


0.3662 


-6.5769 


-1.1190 


-0.7619 


-0.6195 








M2 


FIEGARCH(0, d, 4) 


0.2391 


-0.0456 


0.3963 


-6.6278 






0.2289 


0.1941 


0.4737 


-0.4441 


M3 


FIEGARCH(0, d, 1) 


0.4312 


-0.1095 


0.3376 


-6.6829 






0.5454 








M4 


FIEGARCH(0, d, 1) 


0.3578 


-0.1661 


0.2792 


-7.2247 






0.6860 








M5 


FIEGARCH(1, d, 1) 


0.4900 


-0.0215 


0.3700 


-5.8927 


0.1409 




-0.1611 









From Theorem 12.11 we have 



a(l) 



r(d)k 1 ' d /?(!)' 



as k 



oo, 



and we conclude that \ d ^ k — o{k d ) and \ d , k = 0(fc d_1 ), as k goes to infinity. However, the speed of 
the convergence varies from model to model, as we show in Table 4.2 For simplicity, let Qi(-) and 
Q 2 (-) be defined as 



Oi(fc) 



and Q2(k) := A 



d.k 



a(l) 



r(d)fc!- rf j3{\) 



for all k > 0. 
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Table 4.2 presents the values of the coefficients \d,k, given in Proposition 2.1, for k G {0; 10; 



100; 1,000; 5,000; 10,000; 20,000; 50,000; 100,000}, for each simulated model Mi, i G {!,••• ,5}. 



Note that, for k > 5,000, the coefficient values decrease slowly. We also report in Table 4.2 Q\{k) 
and Q2{k) values for the correspondent A^fc value. Note that, for k G {10,000; 50,000; 100,000}, 
the value Qi(k) is very close to zero, for all models. Also notice that, while Q2(k) converges to 1 
faster for model Ml than for the other models. 

Table 4.2: Coefficients Xd,k an d the quotients Qi(k) and Q2(k), for different values of k, for all 
models. 



Ml 

^d.k QiW Q 2 {k)' 



M2 



M3 



M4 

Qi(fc) ggg 



M5 

^d.fc Qi(fc) Q2(fc) 




10 

100 
1,000 
5,000 
10,000 
25,000 
50,000 
100,000 



1.00000 
0.26537 
0.07167 
0.02015 
0.00830 
0.00567 
0.00342 
0.00234 
0.00160 



0.09426 
0.00904 
0.00090 
0.00018 
0.00009 
0.00004 
0.00002 
0.00001 



1.04410 
1.00173 
1.00017 
1.00003 
1.00002 
1.00001 
1.00000 
1.00000 



1.00000 
-0.09039 
0.01450 
0.00251 
0.00074 
0.00043 
0.00022 
0.00013 
0.00008 



0.05212 
0.00482 
0.00048 
0.00010 
0.00005 
0.00002 
0.00001 
0.00000 



-1.08434 
1.00292 
1.00027 
1.00005 
1.00003 
1.00001 
1.00001 
1.00000 



1.00000 
0.31434 
0.07844 
0.02106 
0.00843 
0.00568 
0.00337 
0.00227 
0.00153 



0.11647 
0.01077 
0.00107 
0.00021 
0.00011 
0.00004 
0.00002 
0.00001 



1.08789 
1.00576 
1.00056 
1.00011 
1.00006 
1.00002 
1.00001 
1.00001 



1.00000 
0.36874 
0.06738 
0.01517 
0.00539 
0.00345 
0.00192 
0.00123 
0.00079 



0.16178 
0.01297 
0.00128 
0.00026 
0.00013 
0.00005 
0.00003 
0.00001 



1.26414 
1.01350 
1.00129 
1.00026 
1.00013 
1.00005 
1.00003 
1.00001 



1.00000 
0.12291 
0.03897 
0.01207 
0.00531 
0.00373 
0.00234 
0.00164 
0.00115 



0.03977 
0.00408 
0.00041 
0.00008 
0.00004 
0.00002 
0.00001 
0.00000 



0.97189 
0.99720 
0.99972 
0.99994 
0.99997 
0.99999 
0.99999 
1.00000 



4.2 Estimation 



In this study we consider the quasi-likelihood method to estimate the parameters of FIEGARCH 
models for the simulated time series. Given any time series {X t }™ =1 , this method assumes that 
Xt\J-t-i, for all t G is normally distributed. The vector of unknown parameters is denoted by 



i 1 ={d;u>;6;\;a 1 ,--- ,a p ;^,--- ,0,)' eR p+q+i 
and the estimator f) of r] is the value that maximizes 

ln(£(0;x))=-"ln(27r)--£ 



ln(a t 2 ) + % 



(4.1) 



Since the processes {X t } t< i and {Z t } t< i are unknown, we need to consider a set I of initial 
conditions in order to start the recursion and to obtain the random variable ln(af ), for t G {1, • • • , n}. 



Then we use these estimated values to solve (4.1). For this simulation study we assume, as initial 
conditions, Zt = 0, of = a\ and Xt := <JtZt = 0, whenever t < 1, where b\ is the sample variance of 
{JTi}" =1 . This is the initial set suggested by Bollerslev and Mikkelsen (1996). The random variables 



i}, are then estimated upon considering the set Iq of initial conditions and the 

where n is 



n. 



ln(a t 2 ), for te {1, • 

known values {X t }™ =1 . The infinite sum in the polynomial A(-) is truncated at m 
the available sample size. 

|4.5| present the kernel distribution of the parameter estimators for each considered 



Figures 4.1 



model when n £ {2, 000; 5, 000}. These graphs help to illustrate the results presented in Table 4.3 



It is easy to see that, for most estimates, the density function is approximately symmetric. For some 
parameters, we notice the presence of possible outliers, see for instance the graphs for parameter 
d, from model Ml, and parameters with i G {2,3,4}, for model M2. Although the graphs for 
n = 2,000 and n = 5,000 are similar, one observes that, as expected, the observations tend to 
concentrate closer to the mean when n = 5, 000. 
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Figure 4.1: Kernel density function of the estimates for model Ml, for n G {2,000; 5,000}. 






Figure 4.4: Kernel density function of the estimates for model M4, for n £ {2,000; 5,000}. 
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Figure 4.5: Kernel density function of the estimates for model M5, for n G {2,000; 5,000}. 



For any model, % denotes the r\ estimate in the fc-th replication, where k € {!,••• ,re}, re 



1,000 and rj is any vector parameter given in Table [47Tj Table 4.3 summarizes the results on the 
parameter estimation procedure. The reported values are the mean fj, the standard deviation (sd), 
the bias (bias), the mean absolute error (mae) and the mean square error (mse) values, defined by 



^ re 

fj := — ^Vk, sd := 



re 

k=l 



\ 



— y~](Vk - V) 2 , bias := — Y^e fe , mae := — V" \e k \, and mse := — V" e\ 
re — ' re * — ' re z — ' re — ' 

k=l k=l k=l k=l 



where := % — r/, for k G {1, • • ■ , re}. 



From Table 4.3 we conclude that, given the models complexity, the quasi-likelihood method per- 
forms relatively well. Since model M2 presents more parameters than the other models, which im- 
plies a higher dimension maximization problem, one would expect that the quasi-likelihood method 
would present the worst performance in this case. However, the estimation results for model M2 
(d = 0.2391) and M4 (d = 0.3578) are similar and the quasi-likelihood method performs better for 
model M2 than for models Ml (d = 0.4495), M3 (d = 0.4312) and M5 (d = 0.4900). By comparing 
the mae and mse values, given in Table |4.3[ we conclude that the worst performance occurs for 
models Ml and M5. This outcome is explained by the fact that the parameter d is very close to the 
non-stationary region for model M5 and, for model Ml, not only p = 2 but also d = 0.4495, which 
implies a more complex model with stronger long-range dependence. The small bias values indicate 
that, for all parameters, the mean estimated value is very close to the true value. Although for 
n = 2, 000 the standard deviation of several estimates is high if compared with the mean estimated 
value, as expected, the estimators performance improves as the sample size increases. 
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Table 4.3: Estimation results. 



Sample Size (n) 



2,000 



5,000 



Model 



'/ 



fid 



bias 



i) 



bias 



Ml 



ft 



0.4495 
-0.1245 

0.3662 
-6.5769 
-1.1190 
-0.7619 
-0.6195 



0.4104 
-0.1255 

0.3651 
-6.5757 
-1.0211 
-0.7304 
-0.4659 



0.0743 
0.0230 
0.0480 
0.3955 
0.3776 
0.3097 
0.3784 



-0.0391 
-0.0010 
-0.0011 
0.0012 
0.0979 
0.0315 
0.1536 



0.0522 
0.0185 
0.0387 
0.3156 
0.2719 
0.2048 
0.2404 



0.0070 
0.0005 
0.0023 
0.1565 
0.1521 
0.0969 
0.1668 



0.4133 
-0.1255 

0.3653 
-6.5765 
-1.0273 
-0.7279 
-0.4766 



0.0776 
0.0220 
0.0458 
0.3916 
0.3664 
0.3035 
0.3719 



-0.0362 
-0.0010 
-0.0009 
0.0004 
0.0917 
0.0340 
0.1429 



0.0495 
0.0176 
0.0365 
0.3128 
0.2588 
0.1965 
0.2282 



0.0073 
0.0005 
0.0021 
0.1534 
0.1427 
0.0933 
0.1587 



M2 



Pi 
ft 
ft 



0.2391 
-0.0456 
0.3963 
-6.6278 
0.2289 
0.1941 
0.4737 
-0.4441 



0.2086 
-0.0461 
0.3933 
-6.6325 
0.2525 
0.1877 
0.4605 
-0.4535 



0.1188 
0.0239 
0.0497 
0.0918 
0.1479 
0.1082 
0.1204 
0.1302 



-0.0305 
-0.0005 
-0.0030 
-0.0047 
0.0236 
-0.0064 
-0.0132 
-0.0094 



0.0959 
0.0192 
0.0383 
0.0731 
0.0988 
0.0656 
0.0631 
0.0843 



0.0150 
0.0006 
0.0025 
0.0085 
0.0224 
0.0117 
0.0147 
0.0170 



0.2208 
-0.0459 
0.3949 
-6.6271 
0.2422 
0.1938 
0.4655 
-0.4483 



0.0833 
0.0148 
0.0323 
0.0663 
0.0978 
0.0819 
0.0859 
0.0732 



-0.0183 
-0.0003 
-0.0014 
0.0007 
0.0133 
-0.0003 
-0.0082 
-0.0042 



0.0615 
0.0117 
0.0250 
0.0518 
0.0631 
0.0421 
0.0398 
0.0489 



0.0073 
0.0002 
0.0010 
0.0044 
0.0097 
0.0067 
0.0074 
0.0054 



M3 



0.4312 
-0.1095 

0.3376 
-6.6829 

0.5454 



0.3703 
-0.1121 

0.3372 
-6.6954 

0.5839 



0.1474 
0.0227 
0.0433 
0.4058 
0.1508 



-0.0609 
-0.0026 
-0.0004 
-0.0125 
0.0385 



0.1040 
0.0180 
0.0347 
0.3253 
0.1203 



0.0254 
0.0005 
0.0019 
0.1648 
0.0242 



0.4148 
-0.1100 

0.3366 
-6.6942 

0.5582 



0.0577 
0.0142 
0.0272 
0.3516 
0.0782 



-0.0164 
-0.0005 
-0.0010 
-0.0113 
0.0128 



0.0458 
0.0113 
0.0218 
0.2865 
0.0625 



0.0036 
0.0002 
0.0007 
0.1238 
0.0063 



Mi 



0.3578 
-0.1661 

0.2792 
-7.2247 

0.6860 



0.3094 
-0.1698 

0.2794 
-7.2171 

0.7095 



0.1240 
0.0207 
0.0368 
0.2729 
0.1003 



-0.0484 
-0.0037 
0.0002 
0.0076 
0.0235 



0.0921 
0.0167 
0.0293 
0.2150 
0.0802 



0.0177 
0.0004 
0.0014 
0.0745 
0.0106 



0.3408 
-0.1667 

0.2787 
-7.2176 

0.6963 



0.0590 
0.0136 
0.0227 
0.2204 
0.0570 



-0.0170 
-0.0006 
-0.0005 
0.0071 
0.0103 



0.0453 
0.0108 
0.0182 
0.1750 
0.0461 



0.0038 
0.0002 
0.0005 
0.0486 
0.0034 



M5 



ai 

ft 



0.4900 
-0.0215 

0.3700 
-5.8927 

0.1409 
-0.1611 



0.4390 
-0.0228 

0.3704 
-5.9000 

0.1605 
-0.0817 



0.1602 
0.0314 
0.0500 
0.2418 
0.4387 
0.4680 



-0.0510 
-0.0013 
0.0004 
-0.0073 
0.0196 
0.0794 



0.1216 
0.0247 
0.0401 
0.1926 
0.3619 
0.3762 



0.0283 
0.0010 
0.0025 
0.0585 
0.1929 
0.2253 



0.4589 
-0.0214 

0.3702 
-5.8988 

0.1944 
-0.0608 



0.1092 
0.0188 
0.0334 
0.2137 
0.3916 
0.4370 



-0.0311 
0.0001 
0.0002 

-0.0061 
0.0535 
0.1003 



0.0765 
0.0150 
0.0268 
0.1705 
0.3157 
0.3311 



0.0129 
0.0004 
0.0011 
0.0457 
0.1562 
0.2010 



4.3 Forecasting 



In what follows we discuss the simulation results related to forecasting based on the fitted FIE- 
GARCH models. To access the models forecast performance, during the generating process, we 
create 50 extra values for each simulated time series. Those values are used here to compare with 
the n-step ahead forecast, for h G {1, ■ ■ • , 50}. For each replication, the /i-step ahead predictor for 
for all h G {!,-•• ,50}, is obtained from expression (3.7) upon replacing the true parame- 



u n+hi 

ters values r\ 

nK +h \Fn) = 

Table 
model Mi 



= (d;u;9;X;ai,--- ,a p ;/3i, 
E(al +h \T n ), we also set Xl 



4.4 



presents the mean over 1 



, f3 q )' by the estimated ones. 

^2 



Based on the fact that 



a n+hi f° r aU h > 0. 



000 simulated values of crf n+h 



the corresponding /i-step ahead predicted values of t+/l := Xf t+h 



for each i 6 {1, • • • , 5} 
and, in parenthesis, the mean absolute error of forecast (mae), for h€ {!,• 
origin n e {2, 000; 5, 000}. Figure 4.6 illustrates the results given in Table [474} 

10} and i G {1 



and Xf,n+h> 



obtained from 



, 10} and forecasting 



From Table 



4.4 



we conclude that, for of h G {1, • • • , 10} and i G {1, • • • ,5}, the predicted 
values are relatively close to the simulated ones, which is indicated by the small mae values. We also 
observe that the mae value increases as h increases. This result is expected and it is theoretically 
explained in Proposition 3.1 Table 4.4 also shows that, for Xf n+h , 



the mae is usually high, if 



compared to the mean simulated and mean predicted values. Therefore, we conclude that X, 



n-\-h 
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Table 4.4: Mean simulated values for crf n+h and Xf n+h , obtained from model Mi, i <G {1, • • • ,5}, and the 
mean predicted values af n+h = Xf n+h , for h £ {1, • • • , 10} and forecasting origin n <G {2, 000; 5, 000}. In 
parenthesis, the mean absolute error of forecast. 



n 




2,000 






5,000 




h 


Predictor 


®i,n-\-h 


x 2 + h 


Predictor 


^i,n-\-h 


i,n-\-n 








Ml (i = 










1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



0.00183 
0.00185 
0.00180 
0.00181 
0.00179 
0.00180 
0.00179 
0.00179 
0.00179 
0.00179 



0.00182 
0.00182 
0.00179 
0.00185 
0.00185 
0.00184 
0.00182 
0.00181 
0.00178 
0.00181 



0.00014) 
0.00040) 
0.00054) 
0.00068) 
0.00072) 
0.00075) 
0.00073) 
0.00075) 
0.00074) 
0.00076) 



0.00165 
0.00197 
0.00186 
0.00194 
0.00190 
0.00181 
0.00175 
0.00179 
0.00183 
0.00183 



0.00169) 
0.00179) 
0.00185) 
0.00195) 
0.00195) 
0.00178) 
0.00183) 
0.00185) 
0.00186) 
0.00195) 



0.00181 
0.00178 
0.00179 
0.00176 
0.00177 
0.00176 
0.00177 
0.00176 
0.00177 
0.00176 



0.00180 
0.00172 
0.00176 
0.00177 
0.00179 
0.00176 
0.00172 
0.00173 
0.00176 
0.00175 



0.00013) 
0.00036) 
0.00053) 
0.00067) 
0.00068) 
0.00073) 
0.00068) 
0.00071) 
0.00070) 
0.00069) 



0.00163 
0.00167 
0.00185 
0.00174 
0.00162 
0.00148 
0.00166 
0.00177 
0.00173 
0.00188 



0.00165) 
0.00172) 
0.00188) 
0.00175) 
0.00170) 
0.00161) 
0.00172) 
0.00181) 
0.00180) 
0.00191) 



M2 (i = 2) 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



0.00145 
0.00147 
0.00143 
0.00144 
0.00144 
0.00141 
0.00143 
0.00142 
0.00142 
0.00143 



0.00145 
0.00147 
0.00143 
0.00146 
0.00144 
0.00141 
0.00144 
0.00143 
0.00143 
0.00142 



0.00011) 
0.00032) 
0.00034) 
0.00036) 
0.00042) 
0.00041) 
0.00043) 
0.00043) 
0.00044) 
0.00042) 



0.00148 
0.00143 
0.00145 
0.00144 
0.00135 
0.00142 
0.00150 
0.00146 
0.00131 
0.00143 



0.00148) 
0.00144) 
0.00137) 
0.00139) 
0.00138) 
0.00147) 
0.00147) 
0.00144) 
0.00135) 
0.00144) 



0.00144 
0.00143 
0.00144 
0.00143 
0.00143 
0.00143 
0.00142 
0.00143 
0.00142 
0.00142 



0.00144 
0.00143 
0.00144 
0.00145 
0.00144 
0.00146 
0.00143 
0.00143 
0.00143 
0.00142 



0.00007) 
0.00031) 
0.00033) 
0.00037) 
0.00043) 
0.00043) 
0.00041) 
0.00042) 
0.00044) 
0.00042) 



0.00139 
0.00147 
0.00158 
0.00153 
0.00148 
0.00140 
0.00142 
0.00142 
0.00143 
0.00138 



0.00143) 
0.00145) 
0.00151) 
0.00145) 
0.00145) 
0.00136) 
0.00141) 
0.00144) 
0.00141) 
0.00142) 



M3 (i = 3) 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



0.00158 
0.00157 
0.00157 
0.00157 
0.00157 
0.00157 
0.00158 
0.00158 
0.00158 
0.00159 



0.00157 
0.00158 
0.00159 
0.00157 
0.00157 
0.00158 
0.00157 
0.00157 
0.00160 
0.00162 



0.00009) 
0.00033) 
0.00042) 
0.00049) 
0.00055) 
0.00060) 
0.00060) 
0.00060) 
0.00064) 
0.00066) 



0.00167 
0.00152 
0.00145 
0.00155 
0.00161 
0.00152 
0.00151 
0.00170 
0.00170 
0.00173 



0.00158) 
0.00147) 
0.00154) 
0.00156) 
0.00166) 
0.00154) 
0.00152) 
0.00178) 
0.00170) 
0.00176) 



0.00151 
0.00151 
0.00151 
0.00151 
0.00152 
0.00152 
0.00152 
0.00152 
0.00152 
0.00152 



0.00150 
0.00150 
0.00151 
0.00151 
0.00151 
0.00151 
0.00152 
0.00152 
0.00150 
0.00151 



0.00005) 
0.00027) 
0.00041) 
0.00048) 
0.00050) 
0.00053) 
0.00057) 
0.00061) 
0.00059) 
0.00059) 



0.00147 
0.00149 
0.00151 
0.00151 
0.00153 
0.00147 
0.00149 
0.00150 
0.00148 
0.00143 



0.00139) 
0.00147) 
0.00150) 
0.00152) 
0.00150) 
0.00153) 
0.00156) 
0.00153) 
0.00147) 
0.00147) 



M4 (i = 4) 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



0.00098 
0.00097 
0.00096 
0.00095 
0.00094 
0.00094 
0.00094 
0.00093 
0.00093 
0.00093 



0.00097 
0.00096 
0.00095 
0.00096 
0.00098 
0.00099 
0.00098 
0.00098 
0.00098 
0.00098 



0.00006) 
0.00019) 
0.00028) 
0.00034) 
0.00039) 
0.00044) 
0.00044) 
0.00045) 
0.00047) 
0.00047) 



0.00091 
0.00098 
0.00098 
0.00104 
0.00102 
0.00096 
0.00101 
0.00102 
0.00103 
0.00103 



0.00089) 
0.00094) 
0.00097) 
0.00100) 
0.00102) 
0.00098) 
0.00103) 
0.00106) 
0.00102) 
0.00103) 



0.00092 
0.00092 
0.00092 
0.00092 
0.00092 
0.00092 
0.00092 
0.00091 
0.00091 
0.00091 



0.00092 
0.00092 
0.00092 
0.00092 
0.00092 
0.00093 
0.00092 
0.00093 
0.00092 
0.00094 



0.00003) 
0.00018) 
0.00026) 
0.00031) 
0.00035) 
0.00037) 
0.00038) 
0.00041) 
0.00041) 
0.00044) 



0.00089 
0.00088 
0.00091 
0.00090 
0.00095 
0.00080 
0.00101 
0.00093 
0.00092 
0.00099 



0.00085) 
0.00085) 
0.00089) 
0.00091) 
0.00093) 
0.00083) 
0.00095) 
0.00099) 
0.00097) 
0.00101) 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



0.00292 
0.00295 
0.00295 
0.00295 
0.00295 
0.00295 
0.00295 
0.00295 
0.00295 
0.00295 



0.00292 
0.00297 
0.00296 
0.00295 
0.00296 
0.00296 
0.00296 
0.00296 
0.00297 
0.00294 



0.00018) 
0.00059) 
0.00059) 
0.00061) 
0.00061) 
0.00061) 
0.00064) 
0.00061) 
0.00065) 
0.00063) 



0.00310 
0.00301 
0.00293 
0.00294 
0.00296 
0.00298 
0.00303 
0.00306 
0.00289 
0.00297 



M5 (i = 

0.00296) 
0.00290) 
0.00291) 
0.00288) 
0.00284) 
0.00295) 
0.00289) 
0.00298) 
0.00297) 
0.00309) 



5) 



0.00298 
0.00295 
0.00296 
0.00295 
0.00295 
0.00296 
0.00295 
0.00295 
0.00295 
0.00295 



0.00296 
0.00291 
0.00294 
0.00298 
0.00298 
0.00299 
0.00300 
0.00293 
0.00298 
0.00296 



0.00012) 
0.00056) 
0.00058) 
0.00061) 
0.00063) 
0.00061) 
0.00063) 
0.00061) 
0.00066) 
0.00064) 



0.00277 
0.00298 
0.00315 
0.00322 
0.00310 
0.00321 
0.00277 
0.00307 
0.00304 
0.00308 



0.00276) 
0.00299) 
0.00292) 
0.00308) 
0.00292) 
0.00304) 
0.00285) 
0.00307) 
0.00302) 
0.00299) 



24 



Theoretical Results on FIEGARCH Processes 



&n+h i s a P oor estimator for X^ +h . This result is not a surprise since the main purpose of FIEGARCH 
models is to estimate the logarithm of the conditinal variance of the process and not the process 
itself. 

As expected, in all cases, the models' forecasting performance improves as n increases. Notice, 
however, that the difference in the mae values, from n = 2,000 to n = 5,000, is small. This is so 
because the coefficients Xd,k converges to zero, as k goes to infinity. Therefore, it is expected that, 
for some m G M and any M > 0, using the last m or the last m + M known values to calculate the 
/i-step ahead forecast value for the process will not considerably change the results. 




Figure 4.6: Mean simulated values for crf n+h and Xf n+h , obtained from model Mi, i £ {1, • • ■ , 5}, and mean 
predicted values of n+/l = Xf n+h , for h € {1, • • • , 10} and forecasting origin n 6 {2, 000; 5, 000}. For each 
model the graphs correspond to the mean taken over 1,000 replications. 
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Figure 4.7 shows the mean simulated values of crf_ n+h and Xf n+h , for i G {!,••■ ,5}, and the 



corresponding predicted values, obtained from expression (3.7), for h G {1, • • ■ , 50} and forecasting 
origin n G {2, 000; 5, 000}. 
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Model M1, n = 5000 
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Figure 4.7: Graphs of the /i-step ahead forecast values X^ n+h = df n+h , for h € {1, • • • , 50} and forecasting 
origin n G {2, 000; 5, 000}, and the corresponding simulated values Xf n+h and crf n+h from model Mi, i € 
{!,••• ,5}. For each model the graphs correspond to the mean taken over 1,000 replications. 



From Figure 4.7 we observe that, for all models, the predicted value converges to a constant as 
h increases. This is expected since the /i-step ahead predictor is defined in terms of the conditional 
expectation. We also observe that, if we had considered the /i-step ahead predictor &f n+h instead 



of 

&i,n+h> ^he estimation bias would have been much higher. Indeed, from expression ( |3.5[ ), <jf n+h 
converges to e u w as h goes to infinity, where uj(i) denotes the parameter to for model Mi, for each 
i G {1, • • • , 5}. The values of u{i) are given in Table 



4.1 



and the values of e w W, for i G {1, • • • , 5}, 



are respectively 0.0014, 0.0013, 0.0013, 0.0007 and 0.0028. 
Notice now that equations (3.5) and (3.8) imply 
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-2 

u i,n+h 



^ h-2 



fe=0 



1 oo 1 rn 



(4.2) 



fe=0 



fc=0 



for any % 6 {1, ■ • • ,5} and m sufficiently large. By taking m = 50,000 in (4.2), the limits are 
0.00173, 0.00142, 0.00155, 0.00091 and 0.00294, respectively, for i 6 {1, • • • , 5}. Upon comparing 
these values with the limiting constants in Figure 4.7 we conclude that, in fact, expression (4.2) 
holds for all models. 



5 Analysis of an Observed Time Series 

This section presents the analysis of the Sao Paulo Stock Exchange Index (Bovespa Index or 
IBovespa) log-return time series. We consider the FIEGARCH model, fully described in this paper, 
and we compare its forecasting performance with other ARCH-type models. The total number of 
observations for the IBovespa time series is n = 1737. We consider the first 1717 observations to fit 
the models and we reserve the last 20 ones to compare with the out-of-sample forecast. 

Figure 5.1 (a) presents IBovespa time series {Pt}l=i, i n the period of January/1995 to Decem- 
ber/2001. We observe a strong decay in the index value close to t = 1,000 (that is, January 15, 



1999). This period is characterized by the Real (the Brazilian currency) devaluation. Figures 5.1 
(b) and (c) present, respectively, the IBovespa log-return time series, {rt}]™ ', and the square of the 
log-return time series, {rf}\™ , in the same period. Observe that the log-return series presents the 
stylized facts of financial time series such as apparent stationarity, mean around zero and clusters of 



volatility. Also, in Figure 5.2 we observe that, while the log- return series presents almost no correla- 
tion, the sample autocorrelation of the square of the log-return series assumes high values for several 
lags, pointing to the existence of heteroskedasticity and possibly long memory. Notice that the pe- 
riodogram of {ln(rf )}jl\ 7 , presented in Figure 2.3 (c), also indicates possibly long-memory in the 



conditional variance. Regarding the histogram and the QQ-Plot, we observe that the distribution 
of the log- return series seems approximately symmetric and leptokurtic. 





JUL 



(a) {PtVZf 



(b) {rt} 1 ™ 



(c) {^Ji 7 



Figure 5.1: Time series: (a) Bovespa index; (b) IBovespa log-returns; (c) square of the IBovespa log-returns, 
in the period of January/1995 to December/2001. 



The analysis of the sample autocorrelation function suggests an ARMA(pi, gi)-FIEGARCH(p 2 , d, q 2 ) 
model. While an ARMA model accounts for the correlation among the log-returns, a FIEGARCH 
model take into account the long memory (in the conditional variance) and the heteroskedasticity 
characteristics of the time series. 

Initially, we consider all possible models with pi,qi € {0, 1, 2, 3} and p 2 , qi € {0, 1, 2} and applied 
the quasi-likelihood method to estimate the unknown parameters. Then, we select the best models, 
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(a) (b) (c) (d) 



Figure 5.2: (a) Histogram; (b) QQ-Plot and (c) sample autocorrelation of the IBovespa log-return series and 
(d) sample autocorrelation of the square of the IBovespa log-return series. 



with respect to the log-likelihood, AIC, BIC and HQC criteria. After removing the non-significant 
parameters and performing the residual analysis we conclude that the best model is an ARMA(0,1)- 
FIEGARCH(0, d, 1). We compare the forecasting performance of this model with other ARCH-type 
models. 

Table 5.1: Fitted Models and their respective log-likelihood values. 



Estimate 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 




ARCH(l) 


ARCH(6) 


GARCH(1,1) 


EGARCH(0,1) 


FIEGARCH(0,d,l) 


01 


-0.1138 (0.0200) 


-0.0642 (0.0267) 


-0.0647 (0.0266) 


-0.0751 (0.0254) 


-0.0776 (0.0257) 


uj 


0.0004 (0.0000) 


0.0002 (0.0000) 


0.0000 (0.0000) 


-7.4694 (0.0969) 


-7.2247 (0.2143) 


&i 


0.6071 (0.0581) 


0.2307 (0.0417) 


0.2019 (0.0247) 






&2 




0.1540 (0.0333) 








&3 




0.1852 (0.0390) 








&4 




0.1145 (0.0348) 








&S 




0.0641 (0.0290) 








& 6 




0.0635 (0.0257) 








h 






0.7659 (0.0271) 


0.9373 (0.0103) 


0.6860 (0.0986) 


d 










0.3578 (0.0810) 











-0.1653 (0.0197) 


-0.1661 (0.0224) 


7 








0.2782 (0.0300) 


0.2972 (0.0332) 


log-likelihood 


3934.337 


4060.372 


4072.622 


4137.6251 


4138.5518 



Note: The number in parenthesis corresponds to the standard error of the estimate. 



The estimated coefficients for each model are given in Table 5.1, with the corresponding log- 
likelihood value. Notice that, the FIEGARCH model fitted to this time series present the same 
parameters values as model M4 considered in the simulated study in Section |4j Table 5^2 presents 



the mean absolute error (moe), the mean percentage error (mpe) and the maximum absolute error 
(maxae) of forecast, respectively defined as 

, 20 1 20 |e I 

™ ^ 2 + ' and max ae := max {|e n+fc |} 



h—l h^ 1 " ! " 



where, e n+ h := f, 2 1+/l — rf l+h , for he {1, • • • ,20} and n = 1717, is the /i-step ahead forecast error. 
Note that, from the ARMA(0,1) part of the models, r t = X t — 9iX t -i, where X t = a t Z t , for all 
t £ Z. Since we define f\ +h = E(r 2 +h |J" t ) and erf is Tt-\ -measurable, for all t G Z, by elementary 
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calculations we conclude that, ff l+1 



crl +1 + B\X 2 n and f 2 n+h 



'n+h 



for all h > 1, with 



&n+i = a n+x- For EGARCH and FIEGARCH models, (T n _i_^ is replaced by o n , ^ , given in expression 



(3.7), and af l+h := cxp{ln(a^ +h )} , where ln(a^ +h ) is defined in Proposition 



3.1 



Table 5.2: Mean absolute error (moe), mean percentage error (mpe) and maximum absolute error (max ae ) 



of forecasting for the models in Tabic 5.1 



Model 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 


ARMA(0,1) + 


ARCH(l) 


ARCH(6) 


GARCH(1,1) 


EGARCH(1,1) 


FIEGARCH(l,d,l) 


Predictor 


*2 






~2 *2 


^-2 -2 
a t+h a t+h 


mae 


0.00053 


0.00045 


0.00043 


0.00045 0.00044 


0.00045 0.00043 


mpe 


109.40844 


68.97817 


60.29677 


71.33057 61.26625 


68.42884 59.88066 


max ae 


0.00094 


0.00094 


0.00094 


0.00082 0.00087 


0.00084 0.00088 



Note: The high mpe values are due to 5 observations close to zero. 



From Table 5.2 we conclude that, given its high mpe value, the ARMA(0,1)-ARCH(1) does not 
fit the data well. In fact, the square of the residuals from this model are still correlated and 
we use the model only for comparison. The ARMA(0,1)-ARCH(6) model performed similar to the 
ARMA(0,1)-GARCH(1,1) model, in terms of both, mae and max ae values, presenting a higher mpe 
value. However, the latter is more parsimonious. Although the log-likelihood value is higher (and 
the max ae value is smaller) for the ARMA(0,1)-EGARCH(0,1) model, the mae and the mpe values are 
smaller for the ARMA(0,l)-GARCH(0,d,l) model. Overall, the ARMA(0,l)-FIEGARCH(0,d,l) performs 
slightly better than the other models. 

The fact that all models present a similar perfomance confirms the following, already known in 
the literature. 



• In practice, ARCH(p) models perform relatively well for most applications. 

• GARCH(p, q) models are more parsimonious than the ARCH ones. For instance, notice that 
similar results were obtained here by considering an ARCH(6) model and a GARCH(1, 1) 
model. 

• For EGARCH(p, q) models the conditional variance is defined in terms of the logarithm func- 
tion and less (usually none) restrictions have to be imposed during parameter estimation. 
Moreover, EGARCH models are not necessarily more parsimonious than ARCH/GARCH 
ones since it also carries information on the returns' asymmetry (6 and 7 parameters). 

• FIEGARCH(p, d, q) models can describe not only the same characteristics as ARCH, GARCH 
and EGARCH models do, but also the long-memory in the volatility. Also, the performance 
of all models will be very similar if the volatility presents high persistence. For instance, 
notice that for the ARCH(6) model ati + ■ ■ ■ + a 6 = 0.812, for the GARCH(1,1) model 
011 + Pi = 0.9678 and for the EGARCH model j3i = 0.9373, which imply high persistence in 
the volatility. Moreover, for the FIEGARCH model, we found d = 0.3578 with standard error 
equal to 0.0810, which indicates that the parameter d is statistically different from zero and 
thus, there is evidence of long- memory in the volatility. 

• Given their definition, it is expected that EGARCH and FIEGARCH models will provide 
better forecasts for ln(cr|, h ) than for <J^ +h and, consequently, for X^ +h . 
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6 Conclusions 

Here we show complete mathematical proofs for the stationarity, the ergodicity, the conditions 
for the causality and invertibility properties, the autocorrelation and spectral density functions 
decay and the convergence order for the polynomial coefficients that describe the volatility for 
any FIEGARCH(p, q, d) process. We prove that if {X t }tez is a FIEGARCH(p, d, q) process and 
E([ln(Zg)] 2 ) < oo, then {ln(A" 2 )} teZ is an ARFIMA(g, d, 0) process with correlated innovations. Ex- 
pressions for the kurtosis and the asymmetry measures of any stationary FIEGARCH(p, d, q) process 
were also provided. 

We also prove that if {A t } te ^ is a FIEGARCH(p, d, q) process then, it is a martingale difference 
with respect to the filtration {Ft}tei-, where T t ■= cr({Z s } s < t ). The /i-step ahead forecast for the 
processes {X t } teZ , {ln(of)} teZ and {ln(X t 2 )} teZ are given with their respective mean square error 
forecast. Since E^.J-Ft) cannot be easily calculated for FIEGARCH models, we also discuss 
some alternative estimators for the /i-step ahead forecast of crf +h , for all h > 0. 

We present a Monte Carlo simulation study showing how to perform the generation, the esti- 
mation and the forecasting of five different FIEGARCH models. The parameter selection of these 
five models are related to the real time series analyzed in Prass and Lopes (2011a). Parameter 
estimation was performed by considering the well known quasi-likelihood method. We conclude 
that, given the complexity of FIEGARCH models, the quasi-likelihood method performs relatively 
well, which is indicated by the small bias, mae and mse values for the estimates. Regarding the 
/i-step ahead forecast for the processes {of } te ^ and {Af } tg ^, we observe that the mean absolute 
error of forecast decreases as the sample size increases. However, while the conditional variance is 
well estimated, which is indicated by the small mae values, the estimator X^ +h := of +/l , which is 
an approximation for Xf +h := E(A 2 +ft |J r „) = <J 2 n+h , does not perform well in predicting X^ +h . This 
result is expected since the purpose of the model is to forecast the logarithm of the conditional 
variance and not the process {Xt}tez itself. 

Finally, we present the analysis of the Sao Paulo Stock Exchange Index (Bovespa Index or 
IBovespa) log-return time series. We compared the forecasting performance of FIEGARCH models, 
fully described in this paper, with other ARCH-type models. All models presented a similar perfor- 
mance which was attributed to the fact that the ARCH, GARCH and EG ARCH models indicated 
high persistence in the volatility. 
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